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Abstract 
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a balance condition between the effective growth rate of incumbent firms (which includes 
their possible demise) and the growth rate of investments in entrant firms. Remarkably, 
Zipf 's law is the signature of the long-term optimal allocation of resources that ensures the 
maximum sustainable growth rate of an economy. 
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1 Introduction 



The relevance of power law distributions of firm sizes to help unde rstand firm and economic 



Schumpeted (119341) . who proposed that there 



growth has been recognized early, for instance by 
might be important links between firm size distributions and firm growth. The endogenous and 
exogenous processes and factors that combine to shape the distribution of firm sizes can be 
expected to be at least partially revealed by the characteristics of the distribution of firm sizes. 
The dis tribution of firrn sizes has also attracted a great deal of attention in the recent policy 



debate (Eurostat. 



(Davis et al 



1998L for instance), because it may influence job cre ation and destruction 



1996h . the response of the economy to monetary shocks (|Gertler and Gilchrist! 



19941) and might even be an important de terminant of productivity growth at the macroeconomic 



level due to the role of market structure (jPerettd, 



1999h . 



19991 : 



Pagano and SchivardiL 



2003 



Acs et al 



This article presents a reduced form model that provides a generic explanation for the ubiq- 
uitous stylized observation of power law distributions of firm sizes, and in particular of Zipf 's 
law - i.e., the fact that the fraction of firms of an economy whose sizes S are larger than s is 
inversely proportional to s: Ft{S > s) ~ s^™, with m equal (or close) to 1. We consider an 
economy made of a large number of firms that are created according to a random birth flow, 
disappear when failing to remain above a viable size, go bankrupt when an operational fault 
strikes, and grow or shrink stochastically at each time step proportionally to their current sizes 
(Gibrat law). 

Our contribution to the ongoing debate on the shape of the distribution of firms' sizes is to 
present a theory that encompasses previous approaches and to derive Zipf 's law as the result of 
the combination of simple but realistic stochastic processes of firms' birth and death together 



with Gibrat 's law ( Gibrati 



193 1|) . The main result of our approach is that Zipf 's law is associated 



with a maximum sustainable growth of investments in the creation of new firms. In this respect, 
the size distribution of firms appears as a device to assess the efficiency and the sustainability 
of the resources allocation process of an economy. Another interesting aspect of our framework 
is the analysis of deviations from the pure Zipf's law (case m = 1) under a variety of circum- 
stances resulting from transient imbalances between the average growth rate of incumbent firms 
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and the growth rate of investments in new entrant firms. These deviations from the pure Zipf 's 
law have been documented for a variety of firm's size proxies (e.g. sales, inco mes, number o ' 



employees, or tota^ 



1977 



Suttonl 



1997 



assets). 



Axtell . 



and reported values for m ranges from 0.8 to 1.2 (|ljri and Simon . 



2001 



among many others). Our approach provides a framework for 



identifying their possible (multiple) origins. 

In the literature on the growth dynamics of business firms, a well established tradition de- 
scribes the change of the firm's size, over a given period of time, as the cumulative effect of a 
num ber of different shocks originated by the diverse accidents that affected the firm in that pe- 



riod (|Kaleckil . 



1945 



Ijri and Simon , 



1977 



Steindl . 



1965 



Suttonl 



1998 



Geros 



2OOOL among 



others). This, together with Gibrat's law of proportional growth, forms the starting point for 
various attempts to explain Zipf 's law. However, these attempts generally start with the implicit 



or explicit assumptio n that t 



time and live forever (|Gibrati . 



l e set of firms under consideration was born at the same o rigin of 



1931 



Gabaixl . 



19991 : 



Rossi-Hansberg and Wright! 



2007alJbh . This 



approach is equivalent to considering that the economy is made of only one single firm and that 
the distribution of firm sizes reaches a steady-state if and only if the distribution of the size of 
a single firm reaches a steady state. This latter assumption is counterfactual or, even worse, 
non-falsifiable. 

An alternative approach to model a stationary distribution of firm sizes is to account for the 
fact that firms do not all appear at the same time but are bom acco rding to a mor e or less regular 



flow of newly created firms, a s suggested by t 



he common sense 



SimonI (1955) was the first to 



address this question (see also lljri and SimonI (Il977|) ). He proposed to modify Gibrat's model by 



accounting for the entry of new firms over time as the overall industry grows. He then obtained a 
steady-state distribution of firm sizes with a regularly varying upper tail whose exponent m goes 
to one from above, in the limit of a vanishingly small probability that a new firm is created. This 
situation is not quite relevant to explain empi rical data, i nsofar as the converge nce toward the 



steady-state is then infinitely slow, as noted by 



KrugmanI (|1996|) . More recently. 



Gabaix 



( 1999 ) 



allowed for birth of new entities, with the probability to create a new entity of a given size 
being proportional to the current fraction of entities of that size and otherwise independent of 
time. In fact, this assumption does not reflect the real dynamics of firms' creation. For instance. 



See Dunne et al. I (ll988h .l [Reynolds et all (119941) or lBonaccorsi Di Patti and DeirAriccial (120041) . among many 
others, for "demographic" studies on the populations of firms. 
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Bartelsman et alj (l2005b document that entrant firms have a relatively small size compared with 
the more mature efficient size they develop as they grow. It seems unrealistic to expect a non- 
zero probability for the birth of a firm of very large si ze, say, of size comparable to the largest 



Luttmerl ( 2007 )'s model is more realistic 



capitalization currently in the markeo In this respect, 
than Gabaix's, (who anyway models city sizes rather than firms) insofar as it considers that 
entrant firms adopt a scaled-down version of the technology of incumbent firms and therefore 
endogenously set the size of entrant firms as a fraction of the size of operating firms. In this 
article, we partly follow this view and consider that the size of entrant firms is smaller than the 
size of incumbent firms. But we depart from Luttmer's because the size of new entrants is not 
endogenously fixed in our model. We set this parameter exogenously for versatility reasons. 

Another crucial ingredient characterizes our model. The fact that firms can go bankrupt 
and disappear from the economy is a crucial observation that is often neglected in models. 
Many firms are known to undergo transie nt periods of decay which, when persistent, may ul 



timately lead to their exit frorn business (IBonaccorsi Pi Patti and DeirAricciaL 



20051 : 



Brixy and Grotd . 



2007 



Simon 



2004: 



Knaur . 

Bartelsman et al.. 2005) . Simonl(ll960l) as we ll as ,SteindL (,19651) 



(119551) where th e decline 
Simon ( 196oi 's model. 



have considered this stylized fact within a generalization of 
of a firm and ultimately its exit occurs when its size reaches zero. In 
the rate of firms' exit exactly compensates the flow of firms' births so that the economy is sta- 
tion^ and t he steady-stat e distribution o f firm sizes exhibit the same upper tail behavior as in 



Simo nl(|1955). In contrast. iSteindll (| 1965|) includes births and deaths but within an industry with 



a growing number of firms. A steady-state distribution is obtained whose tail follows a power 
law with an exponent that depends on the net entry rate of new firms and on the average growth 
rate of incumbent firms. Zipf's law is only recovered in the limit where the net entry rate of 
new firms goes to zero. Both models rely on the existence of a minimum size below which a 
firm runs out of business. This hypothesis corresponds to the existence of a minimum efficient 
size below which a firm cannot operate, as is well established in economic theory. However, 
there may be in general more than one minimum size as the exit (death) level of a firm has 
no reason to be equal to the size of a firm at birth. In the afore mentioned models, these two 
sizes are assumed to be equal, while there is a priori no reason for such an assumption and 



^We do not consider spin-off's or M&A (mergers and acquisitions). 
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empirical evidence a contrario. In our model, we allow for two different thresholds, the first 
one for the typical size of entrant firms and the second one for the exit level. This second level 
is assumed to be lower than the first one, even if recent evide nce seems to suggest that firms 



20011) 



might enter with a size less than their minimum efficient size (|Agarwal and Audretschl 
and then rapidly grow beyond this threshold in order to survive. 

In addition to the exit of a firm resulting from its value decreasing below a certain level, it 
sometimes happens that a firm encounters financial troubles while its asset value is still fairly 
high. One could cite the striking examples of Enron Corp. and Worldcom, whose market cap- 
italization were supposedly high (actually the result of inflated total asset value of about $11 
billion for Worldcom and probably much higher for Enron) when they went bankrupt. More 
recently, since mid-2007 and over much of 2008, the cascade of defaults and bankruptcies (or 
near bankruptcies) associated with the so-called subprime crisis by some of the largest financial 
and insurance companies illustrates that shocks in the network of inter-dependencies of these 
companies can be sufficiently strong to destabilize them. Beyond these trivial examples, there 



is a large empirical literature on firm entries and exits, that suggests t 



count the existence of failure of large firms (IDunne et al. 



1988 



l e need for taking i nto ac 



19891 : 



Bartelsman et al. . 



2005 ). 



To the extent that the empirical literature documents a sizable exit at all size categories, we 
suggest that it is timely to study a model with both firm exit at a size lower bound and due to a 
size-independent hazard rate. Such a model constitutes a b etter approxim ation to the empirical 
data than a model with only firm exit at the lower bound. iGabaixi (|1999.) briefly considers an 
analogous situation (at least from a formal mathematical perspective) and suggests that it may 
have an important impact on the shape of the distribution of firm sizes. 

To sum up, we consider an economy of firms undergoing continuous stochastic growth 
processes with births and deaths playing a central role at time scales as short as a few years. We 
argue that death processes are especially important to understand the economic foundation of 
Zipf 's law and its robustness. In order to make our model closer to the data, we consider two 
different mechanisms for the exit of a firm: (i) when the firm's size becomes smaller than a given 
minimum threshold and (ii) when an exogenous shock occurs, modeling for instance operational 
risks, independently of the size of the firm. The other important issue is to describe adequately 
the birth process of firms. As a counterpart to the continuously active death process, we will 
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consider that firms appear according to a stochastic flow process that may depend on macro- 
economic variables and other factors. The assumptions underpinning this model as well as the 
main results derived from it are presented in section 2. Section 3 puts them in perspective in 
the light of recent theoretical models and empirical findings on the existence of deviations from 
Zipf's law. Section 4 provides complementary results which are important from an empirical 
point of view. All the proofs are gathered in the appendix at the end of the article. 



2 Exposition of the model and main results 



2.1 Model setup 



We consider a reduced form model, with a first set of three assumptions, in which firms are cre- 
ated at random times tj's with initial random asset values Sg's drawn from some given statistical 
distribution. More precisely: 

Assumption 1. There is a flow of firm entry, with births of new firms following a Poisson 
process with exponentially varying intensity iy{t) = uq ■ e*^ *, with d G M; 

This assumption generalizes most previous appro aches that add ress t he question of model- 



Gabaixl ( 1999 )orin 



ing the size distribution of firms. In the basic model of 
(|2007allb|) . all firms (or cities) are supposed to enter at the same time, which is technical 
alent t o consider that there is only one firm in the economy. In Simon's models and in 



Rossi-Hansberg and Wright 



V equiv- 



Luttmei 



(120071), a flow of firms birth is considered, but births occur deterministically at discrete time 
steps (Simon) or continuously in time (Luttmer). Assumption [T] allows for a random flow of 
birth. 

As will be clear later on, the value of the parameter uq is not really relevant for the un- 
derstanding of the shape of the distribution of firm sizes. In contrast, the parameter d, which 
characterizes the growth or the decline of the intensity of firm births, plays a key role insofar as 
it is directly related to the net growth rate of the population of firms. 

We also assume that the entry size of a new incumbent firm is random, with a typical size 
which is time varying in order to account for changing installment costs, for instance. The size 



of a firm can represent its assets value, but for most of tlie developments in this article, the size 
could be measured as well by the number of employees or the sales revenues. 

Assumption 2. At time ti, i E N, the initial size of the new entrant firm i is given by Sq = 
sqa ■ 6*^°*% Co G M. The random sequence {so,i}.gpj is the result of independent and identically 
distributed random draws from a common random variable sq. All the draws are independent 
of the entry dates of the firms. 



This assumption e xogenous l y sets the size of entrant firms. It departs from lGabaixl (| 19991) 



generalized model and 



Luttmen (|2007|) model by considering a distribution of initial firm sizes 



that is unrelated to the distribution of already existing firms. Besides , it does no t impo ses that 



Steindl 



(119651) which 



all the firms enter with the same (minimum) size, as in Simon (.19601') or 
are retrieved by choosing a degenerated distribution of entrant firms and Cq = 0. As we shall see 
later on, apart from the growth rate Cq of the typical size of a new entrant firm, the characteristics 
of the distribution of initial firm sizes is, to a large extent, irrelevant for the shape of the upper 
tail of the steady-state distribution of firm sizes. 

Remark 1. As a consequence of assumptions [H and [2l the average capital inflow per unit time 
- i.e. the average amount of capital invested in the creation of new firms per unit time - is 



dl{t) 



u{t)E [So] e"°* dt , 
z/qE [So] 6^^^+"°)* dt 



(1) 
(2) 



and d + Co appears as the average growth rate of investment in new firms. 



As usual, we also assume that 



Assumption 3. Gibrat's rule holds. 



Assumption [3] means that, in the continuous time limit, the size Si{t) of the i*'* firm of the 
economy at time t > ti, conditional on its initial size Sq, is solution to the stochastic differential 
equation 

dSiit) = S^it) ifidt + adWiit)) , t>ti, Siiti) = 4 . (3) 
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The drift /i of the process can be interpreted as the rate of return or the ex-ante growth rate of 
the firm. Its volatility is a and Wi{t) is a standard Wiener process. Note that the drift fx and the 
volatility a are the same for all firms. 



Simonl 's model by allowing the creation 



This assumption together with assumption[T]extendsb 
of new firms at random times, as already mentioned, and more importantly decouples the growth 
process of existing firms from the process of creation of new firms. It thus makes the model 
more realistic. 



to 



Let us now consider two exit mechanisms. 



Dased on the following empirical facts. Referring 



Bonaccorsi Di Patti and Dell'Ariccial (|2004|) . the yearly rate of death of Italian firms is, on 



average, equa l to 5.7% with a maximum of about 20% for some specific industry branches. 
Knaupl (120051) examined the business survival characteristics of all establishments that started 
in the United States in the late 1990s when the boom of much of that decade was not yet 
showing signs of weakness, and fi nds that, if 85% of firrn s survive more than one year, only 



45% survive more than four years. 



Brixy and Grota (|2007|) analysed the factors that influence 



regional birth and survival rates of new firms for 74 West German regions over a 10-year period. 
They documented significant regional factors as well as variability in time: t he 5 -year survival 



Bartelsman et al. 



rate fl uctuates between 45% and 51% over the period from 1983 to 1992. 
(120051) confirmed that a large number of firms enter and exit most markets every year in a group 
of ten OECD countries: data covering the first part of the 1990s show the firm turnover rate 
(entry plus exit rates) to be between 15 and 20 percents in the business sector of most countries, 
i.e., a fifth of firms are either recent entrants, or will close down within the year. 

First of all, we assume that firms disappear when their asset values become smaller than 
some pre-specified minimum level s^i^. 

Assumption 4. There exists a minimum firm size Smm(^) = ■ e^^^ *, that varies at the constant 
rate Ci < Cq, below which firms exit. 



This idea has been considered in several models of firm growth (see e.g. Ide Witi (|2005|) 
and references therein) and can be related to the existence of a minimum efficient size in the 
presence of fixed operating costs. Besides, as for the typical size of new entrant firms, we 
assume that the minimum size of incumbent firms grows at the constant rate Ci > 0, so that 
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'Smin(i^) := Sie'^^ *. But ci is a priori different from cq. It is natural to require that the lower 
bound Sq of the distribution of sq be larger than si and that cq > ci in order to ensure that no 
new firm enters the economy with an initial size smaller than the minimum firm size and then 
immediately disappear]^. The condition sic"^^ * < Sqc"^" * implies that the economy started at a 
time to larger than 

U = ^—-ln(^]<0. (4) 



We could alternatively choose Sq = 
approach, suggested for instance by 



Ci - Co \Si 

_sj_so tha^Ae economy starts at time t = 0. Another 



Gabaixl (|1999|) . considers that firms cannot decline below 



a minimum size and remain in business at this size until they start growing up again. Here, we 
have not used this rather artificial mechanism. 

Secondly, we consider that firms may disappear abruptly as the result of an unexpected 
large event (operational risk, fraud,...), even if their sizes are still large. Indeed, while it has 
been established that a first-order characterization for firm death involves lower failure rates 



for larger firms (IDunne et al 



1988L 



19891), 



Bartelsman et al. 



(|2005h also state that, for suffi- 



ciently old firms, there seems to be no difference in the firm failure rate across size categories. 
Consequently 

Assumption 5. There is a random exit of firms with constant hazard rate h > max{— rf, 0} 
which is independent of the size and age of the firm. 

Remark 2. As will become clear later on, the constraint h > max{—d, 0} is only necessary 
to guaranty that the distribution of firm sizes is normalized in the small size limit if there is no 
minimum firm size. The case d > ensures that the population of firms grows at the long term 
rate d while the case o? < allows describing an industry branch that first expands, then reaches 
a maximum a nd eventually declines at the rate d. Such a situation is quite realistic, as illustrated 



by figure 2 in 



SuttonI (119971) which depicts the number of firms in the U.S. tire industry. Notice, 



in passing, tha t the case /i < is also sensible. It corresponds to the situation considered by 



Gabaix ( 



19991) in his generalized model, where firms are allowed to enter with an initial size 



randomly drawn from the size distribution of incumbent firms. 



^In fact, it seems that the typical size of entrant firms is much smaller than the minimum efficient size 
dAgarwal and Audretschi 1200 ll and references therein). It means that two exit levels should be considered; one 



for old enough firms and another one for young firms. For tractability of the calculations, we do not consider this 
situation. 



Under as sumptions [Hand [51 i.e. not considering for the time being the mechanism of exit of 
firms at the minimum size, the average number Nt of operating firms satisfies 



dNt 
dt 



+ hNt = u{t), 



(5) 



so that, assuming that the economy starts at t = for simplicity, we obtain 



Nt 



^0 



d + h 



(6) 



Consequently, the rate of firm birth, given by i'{t)/Nt, is given by ^_jLld+h)-t d + h for t 
large eno ugh. The range of va lues ofd + h has been reported in many empirical studies. For 



instance. 



Reynolds et al 



(119941) give the regional average firm birth rates (annual firm births per 
100 firms) of several advanced countries in different time periods: 10.4% (France; 1981-1991), 
8.6% (Germany; 1986), 9.3% (Italy; 1987-1991), 14.3% (United Kingdom; 1980-1990), 15.7% 
(Sweden; 1985-1990), 6.9% (United States; 1986-1988). They also document a large variability 



from o ne industr ial sector to another . More interestingly. 



Bonaccorsi Di Patti and Dell'Ariccia 



2004 ) as well as 



Dunne et al. 



(|1988l) reports both the entry and exit rate for different sectors in 
Italy and in the US respectively. In every cases, even if sectorial differences are reported, the 
average aggregated entry and exit rates are remarquably close. This suggests that d should be 
close to zero while h is about 4 — 6%. The net growth rate of the population of firms, given by 



1 dNt 
Nt dt 



Nt 



h tends to d for t large enough, as announced after assumption [IJ 



2.2 Results 



Equipped with this set of five assumptions, we can now define 



m := - 
2 



1 - 2 • 




and derive our main result (see appendix lA.il for the proof): 
Proposition 1. Under the assumptions 1-5, provided that E [s™] < oo, 



for t — t^':^ 



(^fi - ^ - + 2a\d + h) 



.-1/2 



(7) 



, the average distribution of firm's sizes fol- 
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lows an asymptotic power law with tail index m given by ([7|), in the following sense: the average 
number of firms with size larger than s is proportional to s~"^ as s ^ oo. 

Remark 3. Condition E [5™] < oo in Assumption |2] means that the fatness of the initial distri- 
bution of firm sizes at birth is less than the natural fatness resulting frorn the random growth. 



Such an assumption is not always satisfied, in particular in 



Luttmei 



(l2007h 's model where, due 



to imperfect imitation, the size of entrant firms is a fraction of the size of incumbent firms. 

One can see that the tail index increases, and therefore the distribution of firm sizes becomes 
thinner tailed, as /i decreases and as h, cq, and d increase. This dependence can be easily 
rationalized. Indeed, the smaller the expected growth rate /x, the smaller the fraction of large 
firms, hence the thinner the tail of the size distribution and the larger the tail index m. The 
larger h, the smaller the probability for a firm to become large, hence a thinner tail and a larger 
m. As for the impact of cq, rescaling the firm sizes by e"^" *, so that the mean size of entrant 
firms remains constant, does not change the nature of the problem. The random growth of firms 
is then observed in the moving frame in which the size of entrant firms remains constant on 
average. Therefore, the size distribution of firms is left unchanged up to the scale factor e'^°'^. 
Since the average growth rate of firms in the new frame becomes /i' = fi — cq, the larger cq, 
the smaller fi', hence the smaller the probability for a firm to become relatively larger than the 
others, the thinner the tail of the distribution of firm sizes and thus the larger m. Finally, the 
larger d is, the larger the fraction of young firms, which leads to a relatively larger fraction of 
firms with sizes of the order of the typical size of entrant firms and thus the upper tail of the size 
distribution becomes relatively thinner and m larger. 

As a natural consequence of proposition[il we can assert that 

Corollary 1. Under the assumptions ofpropositionU} the mean distribution of firm sizes admits 
a well-defined steady-state distribution which follows Zipf's law (i.e. m = 1) if, and only if 

fi — h = d + Co . (8) 

Remark 4. In an economy where the amount of capital invested in the creation of new firms is 
constant per unit time, namely 

uit) ■ soit) = const. , (9) 
11 



we necessarily get d + cq = so that the balance condition reads n = h. 



To get an intuitive meaning of the condition in corollary [H let us state the following result 
(see the proof in appendix I A. 2|) : 

Proposition 2. Under the assumptions of propositionUl the long term average growth rate of 
the overall economy is max {n — h^d + cq}. 

The term d + Cq quantifies the growth rate of investments in new entrant firms, resulting 
from the growth of the number of entrant firms (at the rate d) and the growth of the size of 
new entrant firms (at the rate Cq). The term d reflects several factors, including improving pro- 
business legislation and tax laws as well as increasing entrepreneurial spirit. The latter term 
Co is essentially due to time varying installment costs, which can be negative in a pro-business 
economy. 

The other term jj, — h represents the average growth rate of an incumbent firm. Indeed, 
considering a running firm at time t, during the next instant dt, it will either exit with probability 
h ■ dt (and therefore its size declines by a factor —100%) or grow at an average rate equal to 
11 ■ dt, with probability (l — h-dt). The coefficient /i can be called the conditional growth rate of 
firms, conditioned on not having died yet. Then, the expected growth rate over the small time 
increment dt of an incumbent firm is (fi — h)-dt + {dt^). As shown by the following equation, 
drawn from appendix lA. 21 the average size of the economy VL{t) (if we neglect the exit of firms 
by lack of a sufficient size) reads 



where I{t) is the average capital inflow invested in the creation of news firms per unit time (see 
eq. O. Thus fx — his also the return on investment of the economy. 

Thus, the long term average growth of the economy is driven either by the growth of invest- 
ments in new firms, whenever d + cq > /i — /i, or by the growth of incumbent firms, whenever 
fjL — h > d + Cq. The former case does not really make sense, on the long run. Indeed, it 
would mean that the growth of investments in new firms can be sustainably larger than the rate 
of return of the economy. Such a situation can only occur if we assume that the economy is 




(10) 
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fueled by an inexhaustible source of capital, which is obviously unrealistic. As a consequence, 
it is safe to assume fi — h > d + cq on the long run. The regime d + cq > n — h might how- 
ever describe tran sient bubble regimes developing under unsustainably large capital creation 



(|Baily et al.L 



2008 ). 



Proposition 3. In a growing economy whose growth is driven by that of incumbent firms, the 
tail index of the size distribution is such that m < 1. 

Along a balanced growth path, which corresponds to a maximum sustainable growth rate of the 
investment in new firms, the tail index of the size distribution is equal to one. 

Proposition [3] shows that Zipf's law characterizes an efficient and sustainable allocation of 
resources among the firms of an economy. Any deviation from it is the signature of an ineffi- 
ciency and/or an unsustainability of the allocation scheme. In this respect, the size distribution 
of firms is a diagnostic device to assess the efficiency and the sustainability of the allocation of 
resources among firms in an economy. 

Proof. According to the natural assumption that the growth of the economy is driven by the 
growth of incumbent firms, i.e. ji — h > c? + cq, we get d + h < fi — co and e < 1 which leads to 
m < 1 (see illustration on figure [T]); we have used assumption [5] according to which d + h > 
hence /i — Cq > 0. On a balanced growth path, both investments in new firms and incumbent 
firms grow at the same rate fx — h = d + cq, hence the growth rate of the investment in new 
firms is maximum and by corollary [T] the tail index m of the size distribution equals one. □ 

In the present framework, the crucial parameters d,Co, ^ and h are exogenous. While this 
is beyond the scope of the present paper, we can however surmise that, within an endogenous 
theory in which the growth of investments would be naturally correlated with the growth of 
the firms in the economy because the success of firms generates the cash flow at the source of 
new investments, the balance growth condition ([8]) appears almost unavoidable for a sustainable 
development. It is quite remarkable that Zipf's law derives as the robust statistical translation 
of this balance growth condition. 

Remark 5. Our theory suggests two simple explanations for the empirical evidence that the 
exponent m is close to 1 . Either the investment in new firms is close to its maximum sustainable 
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Figure 1: The figure shows the exponent m of the power law tail of the distribution of firm 
sizes, given by ([7]), as a function of ^r^, for different values of the ratio e := Bottom to 
top e 



0.6; 0.8; 1; 1.2; 1.6. 



level so that the balance condition is approximately satisfied, or the volatility a of incumbent 
firms sizes is large. Indeed, according to equation (|7]), the tail index m goes to one as o goes to 
infinity irrespective of the values of the parameters d, ix,co and h. In fact, the larger the volatility, 
the larger the tolerance to the departure from the balance condition. Indeed, expanding relation 
(|7]) for cr large, we get 



m = l- 2- ~ ^ ~ ^0 ~ ^ _^ 4 . {d + h){fi-h-Co-d) ^ ^ f ]_ 



(11) 



and for small departures from the balance condition 



m = 1- 



jj, — Cq — h — d 



)d+h 



/-r2 



(1 + 2^^ 



2 ) 



ji — Cq — h — d 



+0 {{^i-co-h- df) 
(12) 



When the volatility changes, the convergence of the size distribution toward its long-term 
distribution may be faster or slower. Indeed, according to Proposition [H the size distribution 
converges to a power law when the age of the economy is large compared with 
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2 1 

— ^ — Co) + 2cr^((i + h) . This quantity is a decreasing function of the volatility 

if (and only if) ^ > {fi — cq) — 2 {d + h) . Therefore, when the volatility is large Zipf's be- 
comes more robust and the convergence towards Zipf's law is faster. 

Remark 6. The regime where m < 1, which predicts an infinite mean size, seems to violate the 
constraint that there is a finite amount of capital (or, employees) in the economy. This suggests 
that the associated parameter ranges are just not possible in actual economies. Actually, the 
regime m < 1 is perfectly possible, as least in an intermediate asymptotic regime. Indeed, a 
real economy which grows at a non- vanishing growth rate bounded by zero from below is finite 
only because it has a finite age. As explained in section 14.21 the distribution of firm sizes in 
such a finitely lived economy (arguably representing the real world) is characterized by a power 
law regime with exponent m as given by Proposition 1, crossing over to a faster decay at very 
large firm sizes. The cross-over regime occurs for larger and larger firm sizes as the age of 
the economy increases. There is thus no contradiction between the finiteness of the amount of 
capital in the economy and the power law with exponent m < 1 up to an upper domain, so 
that the mean does exist. In other words, the paradox is resolved by correctly ordering the two 
limits: (i) limit of larger firm sizes lim5__i.+oo; (ii) limit of large age of the economy lim6i_s.+oo- 
The correct ordering for a finite and long-lived economy is lime_>.+oolim5_j.+oo5 which means 
that, taking the limit of large firm sizes at fixed large but finite age 9 leads to a finite mean, 
coexisting with a power law intermediate asymptotic with exponent m given by Proposition 1 . 



2.3 Calibration to empirical data 



According to Dunne et al.l (|l988l . table 2), the relative size of entrant firms to incumbent firms 
seems to have slightly declined during the period 1963-1982 in the US. According to our model, 
the ratio of the average size of entrant firms to the average size of incumbent firms is, for large 
enough time t. 



sq ■ e 



co-t 



fi—h~d—CQ 
d+h 

1 1 
d+h ' t ' 

d+h— fi+CQ 
d+h ' 



■ e 



^{ti-h-d-coyt^ provided that /i - /i > ci + Co, (a) 

provided that yU — /i = (i + cq, (6) (13) 
provided that fx — h < d + Cq, (c) 
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where Vlt is the average size of all incumbent firms (see Appendix IA.2I) and Nt is the average 



Dunne et al. 



(|1988|) observe a slight decay 



number of incumbent firms, at time t. The fact that 
in the relative size of entrant firms to incumbent firms suggests that the condition of sustainable 
growth fi — h > d + cq holds. Under this hypothesis, the calibration of equation (fT3la) by OLS 
gives, on an annual basis. 



fx - h - d - Co = 1.8% (1.2%) 



and 



ji — h — d — Cq 
d+h 



28% (3%). 



(14) 



The figures within parenthesis provide the standard deviations of the estimates. As a conse- 
quence, the alternative hypothesis fi — h — d — CQ < cannot be rejected at any usual significance 
level and we cannot affirm that Dunne's data corresponds to the regime fi — h — d — co>0. 

Under the second hypothesis ^ — h = d — CQ, equation (fT3lb) leads to test the null hypothesis 
that the slope of the OLS regression of the logarithm of the size of entrant firms relative to the 
size of incumbent firms against the logarithm of time is equal to —1. Instead, we estimate a 
slope equal to —0.086 (0.068), which is therefore not significantly different from zero. Thus, 
we reject the hypothesis fi — h = d — Cq. 

According to equation (fT3lc). the size of entrant firms relative to the size of incumbent firms 
is constant over the period under consideration. To formally test this hypothesis, we perform 
the OLS regression of the size of entrant firms versus to the size of incumbent firms against 
time. We find that the hypothesis of a time dependent ratio of the size of entrant firms relative to 
the size of incumbent firms is rejected at any usual significance level. We thus have to conclude 
that the third alternative actually holds and we get 



d + h — fi + Cq 
d + h 



25%. 



(15) 



Dunne et al. 



(|1988r) . we obtain d + h — fx + cq 



With the figures d = and h = 5% obtained from|j 
1.25% and — cq = 3.75%. Thus, the balance condition is not strictly satisfied but the observed 
departure from the balance condition remains weak. 

To sum up, reasonable estimates of the key p arameters are h = A — 6%, d = ±0.5%, 



/i — Co = /i ± 2%. As for cr. 



Buldvrev et al. 



(|1997|) report the standard deviations of the growth 
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rates in terms of sales, ass ets, cost of goods sol d and plant property and equipment for US 



publicly-traded companies. iBuldyrev et al. 



(|l997h find that a ranges typically between 30% to 
50%. Based upon this set of figures, relation dV]) leads to a tail index m ranging between 0.7 and 
1.3, in agreement with the range of values usually reported in the literature. 

Proposition[T] states that the asymptotic power law of the distribution of firm sizes can be ob- 

r 2 1 ^^/^ 

served if the age of the economy is large compared with — ^ — cq j +2cr^((i + /i) 
With the set of parameters above, this corresponds to economies whose age is large compared 
to 5 to 12 years. 



3 Discussion 



3.1 Comparison with Gabaix's model 



Gabaixl (|l999h in its basic model, which 



Corollary[i|seems reminiscent of the condition given by|< 
relies on the argument that, because they are all born at the same time, firms grow - on average 
- at the same rate as the overall economy. Consequently, when discounted by the global growth 
rate of the economy, the average expected growth rate of the firms must be zero. Applied to 
our framework, and focusing on the distribution of discounted firm sizes, this argument wou ld 



Gabaixl (Il999h 's 



lead to fi = h, with d = cq = ci = in order to match Gabaix's assumptions, 
condition would thus seem to be equivalent to our balance condition for Zipf 's law describing 
the density of firms' sizes to hold. 

Actually, this reasoning is incorrect. Consider the case where fi > h, such that th e global 
econo my grows at the average growth rate re = — h according to Proposition [2l 



Gabaix 



19991) proposed to measure the growth of a firm in the frame of the global economy. In this 



moving frame, the conditional average growth rate of the firm is fi' = fx — re = h, which 
indeed would suggest that the balance condition is automatically obeyed when yU is replaced 
by fi'. But, one should notice that fi' is a transformed growth rate, and not the true rate. The 
average growth rate tq = ji — hoi the global economy is micro-founded on the contributions of 
all growing firms. It would be incorrect to insert /x' in the statements of Proposition 1 , as /x' is the 
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effective growth rate resulting from the change of frame, while our exact derivation requires the 
parameters jj, and h for Proposition 1 to hold. As such, nothing in our model a utomatically se ts 



the growth rate /i of firms to their death rate h, contrarily to \ yhat happens i n 



model. The main difference that invalidates the application of 
stochastic flow of firm's births and deaths. 



Gabaix 



Gabaix 



1999h 's 



(|1999|) 's argument is the 



It is important to understand that in iGabaixl (|l999r) 's basic model, the derivation of Zipf's 
law relies crucially on a model view of the economy in which all firms are born at the same 
instant. Our approach is thus essentially different since it considers the flow of firm births, as 
well as their deaths, which is more in agreement with empirical evidence. Note also that the 
available empirical evidence on Zipf's law is based on analyzing cross-sectional distributions of 
firm sizes, i.e., at specific t imes. As a consequence, the change to the global economic growth 



frame, argued by 



Gabaix 



(|1999f) . just amounts to multiplying the value of each firm by the 
same constant of normalization, equal to the size of the economy at the time when the cross- 
section is measured. Obviously, this normalization does not chan ge the exponent o f the power 



law distribution of sizes, if it exists. Furthermore, elaborating on 



KrugmanI (Il996h's argume nt 



about t he non-convergence of the d istribution of firm sizes toward Z ipf's law in 



model. 



Blank and SolomonI d2000) have shown that 



Simonl(ll955b 's 



Gabaixl (Il999l) 's argument suffers from a 



more technical problem. Based on the demonstration that the two limits, the number of firms 



^ oo and SmMIVLit) -)> C 



commu t ative. 



Blank and Solomon 



(or eq uivalently the limit of large times t — )• oo) are non- 
J2OO0I) showed that Zipf's exponent m = 1 as obtained by 
Gabaixl (Il999t) 's argument requires (i) taking the long time limit s^i^(t) /VL{t) — )■ over which 
the economy made of a large but finite number A^ firms grows without bounds, while simul- 



taneously obeying the condition (ii ) A^ ::g> eyi\)\VL{t) / Srnmit)]. The problem is that conditions 



(i) and (ii) are mutually exclusive. 



Blank and SolomonI (|2000|) showed that this inconsistency 



can be resolved by allowing the number of firms to grow proportionally to the total size of the 
economy. 

In a generalized approach of his basic model, Gabaix accounts for the app earance of new 
entities with a constant rate u (equal io d + h with our notations) and shows (Gabaix. ,1999 . 
Proposition 3) that, as long as this birth rate is less than the growth rate 7 of existing entities (/z 



"^The term Q,{t) refers to the average size of the economy defined by the sum of the sizes over the population of 
incumbent firms (see ( |69] l in appendix |A.2| i. 
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or /i — /i in our notations), the results of his basic model holds, i.e., Zipf 's law holds. On the 
contrary, he shows that the tail index of the size distribution is equal to m given by (|7]) when 
the birth rate of new entitie s is large r than their growth rate. This result seems in contradiction 



with ours, as well as with 



Luttmeii (120071) 's results, insofar as Proposition [T] states that m is 



the tail index of the size distribution irrespective of the relative magnitude of the birth rate of 
entrant firms and of the growth rate of incumbent ones. The discrepancy between these two 
results comes from an error in Gabaix's proof of Zipf's law in the regime when the birth rate 
of new entities is less than the growth rate of existing entities. The error consists in assuming 
that young firms do not contribute at all to the shape of the tail of the size distribution when u 
is less than 7 If]. Therefore, in the presence of firm entries, Gabaix's approach does not allow to 
explain Zipf's law. 



3.2 Comparison with Luttmer's model 



Based upon s tructural models, an important modeling strateg y has bee n developed, starting from 



(12007a 



LucasI ( 19781) and evolving to the more recent lLuttmeii (12007 , 



20081) or 



Rossi-Hansberg and Wright 



models. The distribution of firm sizes then appears as one of the properties of a 



general equilibrium model, which depends on different industry parameters. In these models. 



Zipf's law is obtained as a limit case, needing a rather sharp fine tuning of the control 



eters. Rossi-Hansberg and Wright's model is a "one firm" model as in 



Gabaixl (Il999h and is 



param- 



therefore subjected to the same restrictions. We do not discuss further this model in light of the 
results of our reduced form model. In contrast, the assumptions underpinning Luttmer's model 
match the assumptions under which proposition[T]holds, which motivates a closer comparison. 



^ To show that Zi pf's law holds as long as the birth rate of new entities is less than the growth rate of existing 
entities, iGabaixl (fl999. Appendix 2) splits the population of cities in two parts: the old ones, whose age is larger 
than T = t/2, and the young ones, whose age is smaller than T = t/2. In the limit of large time t, he shows that 
the size distribution of old cities should follow Zipf's law as a consequence of the results derived from his basic 
model. Then he provides the following majoration of the size distribution of firms born at time t > t/2, i.e., for 
young firms: Pr [S > sjbirthdate = t] < E[S'|birthdate — t]/s. This trivial inequality requires the expectation 
E[5|birthdatc = t] be finite. Thus, Gabaix's derivation crucially relies on the fact that the firms whose ages are 
slightly larger than t/2 are old enough for Zip's law to hold (and thus for E[5|birthdate < t/2] to be arbitrary 
large and infinite for an arbitrarily large economy which allows for the sampling of the full distribution), while the 
firms whose ages are slightly less than t/2 are not old enough for Zipf's law to hold, and therefore they still admit 
a finite average size: E[5|birthdate > t/2] < 00. It is clearly a contradiction as one cannot have simultaneously 



E[5|birthdate = {t/2)+] < 00 and Zipf's law for times {t/2)-. This invalidates eq. (17) in lOabaixl (Il999h 
because the integrand have to diverge as r — > T, with the notations of Gabaix article. 
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Luttmeij (120071) considers an economy of firms with different ages. For a firm of age a, its 
size Sa follows a geometric Brownian motion 



dlnSa = fJ- ■ da + a ■ dWa (16) 

where the drift ^ and volatility a are derived from a micro-economic model and are related to 
the price elasticity of the demand for commodity, to the rate 6'^; at which the productivity 
of entering firms grows over time, to the trend Oj of log productivity for incumbent firms, and 
to the volatility az of the productivity: 

/i = (0, ~9e), a = Y^crz. (17) 

Due to the presence of fixed costs, incumbent firms exit when their size reaches a constant 
minimum size b and in this case only. In our notations, this implies ci = and h = 0. In 
addition, Luttmer assumes that the overall number of incumbent firms grows at a rate r] > 
/i + so that the size of a typical incumbent firm is not expected to grow faster than the 
population growth rate. Within our framework, the number of firms grows, on the long run, 
at the rate d, so that we have the correspondence d = rj. Finally, Luttmer considers that firms 
enter either with a fixed size or with a size taken from the same distribution as the incumbent 
firms; consequ ently, in our not ations, we have cq = 0. Then, by application of propositionfH we 



conclude, as in 



Luttmer ( 



2007L section IILB), that the size distribution of firms follows a power 



law with a tail index given by 



Notice that Luttmer only considers the long term distribution of firm sizes, while our result 
allows considering the transient regime which eventually leads to the power law. In particular, 
accounting for the transient regime avoids resorting to the assumption r] > fi + cr^/2. Indeed, 
in Luttmer' s model, this assumption ensures that the tail index m remains larger than one so 
that, detrended by the overall growth of the number of firms given by e^''*, the average firm 
size is finite. This is a natural requirement if the economy is assumed to be finite. This latter 
assumption is more questionable in economies of infinite duration. In contrast, when finite 
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time effects are considered as in our framework, the average firm size is always finite for finite 
times, since the density of firm sizes decays faster than any power law beyond the intermediate 
asymptotic described by the power law, whether r] > fx + or not. In other words, the 
power law is truncated by a finite time effect, as derived in appendix lA.il As time increases, the 
truncation recedes progressively to infinity, thus enlarging the domain of validity of the power 
law. The exact power law distribution is attained therefore for asymptotically large times (we 
come back to this point latter on in section HTII) . Therefore, the constraint r] > fi + is 
not necessary anymore in our framework, since there is no reason for the average firm size to 
remain finite at infinite times when the size of the overall economy becomes itself infinite. 

The endogeneization of the growth rate of the productivity of entrant firms performed by 
Luttmer in the second part of his article does not match our assumptions, so that we cannot 
proceed further with the comparison of his results with ours. Indeed, in Luttmer's case, the 
upper tail of the distribution of entrant firms behaves as the tail of the distribution of incumbent 
firms, so that as sumption [2] is not satisfied. 



4 Miscellaneous results 



4.1 Distribution of firms' age and declining hazard rate 



Brudel et al.l (|l992|) . ICavesI (|l998L and references therein) or Dunne et al.l (|l988l Il989|) . among 
others, have reported declining hazard rates with age. Under assumption |5l the hazard rate is 
constant, which seems to be counterfactual. However, we now show that the presence of the 
lower barrier below which firms exit allows to account for age-dependent hazard rate. 

Let us denote by 9 the age of a firm at time t, i.e., the firm was bom at time t — 9. Expression 
(|52|) in appendix I A . 1 1 allow s us to derive the probability that, at time t, a firm older that 6 is still 
alive, which corresponds to the distribution of firm ages. Indeed denoting by 6t the random age 
of the considered firm at time t. 



Ft 



Qt>0 



-if 



•^min (^) 



In 



At) 



ds. 



(19) 
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where -ip 



is the size density of firms of age 9 at time t. Some algebraic 



manipulations give 
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In p{t) + {6 - 1 - 6o)t 



- Pit) 



l-S+5o 



erfc 



2VT 

lnp{t)-{6-l-6o)r 
2v^ 



(20) 
(21) 



with r := 5 := f and 5o := ^• 

Accounting for the independence of the random exit of a firm with hazard rate h from the 
size process of the firm (as sumption [5]), the "total" hazard rate reads 



Hit, 



(ilnPr 



Qt>9 
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In 



h + 



S0(t) 



-^min (^) / V '^min(0 



sojt) 
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4t 
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-^^^ -erfc 



■^min (^) 



2v^ 



(22) 



^3) 



assuming, for simplicity, that the random variable sq reduces to a degenerate random variable 
So- Expression (|23l) shows that the failure rate actually depends on firm's age. It also depends 
explicitly on the current time t through the ratio '^"^9,. . 

•^min \J^) 

Let us focus on the case cq = Ci, which corresponds to the same growth rate for so{t) and 
si{t). This allows considering arbitrarily old firms since, according to (lH), the starting point of 
the economy can then be = — oo. We obtain the limit result 




/i - ci - ^ > 0, 
/i - ci - 4 < . 



(24) 



In the moving frame of the exit barrier, /x — ci — ^ is the drift of the log-size of a firm 



d\nS{t) = ( /i-ci - y ) dt + cxdW{t). 



(25) 



Thus, when the drift is positive, the firm escapes from the exit barrier, i.e., its size grows almost 
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surely to infinity, so that the firm can only exit as the consequence of the hazard rate h. On 
the contrary, when the drift is non-positive, the firm size decreases and reaches the exit barrier 
almost surely, so that the firm exits either because it reaches the exit barrier or because of the 
hazard rate h. Hence the result that the asymptotic total failure rate is the sum of the exogenous 



failure of a firm when it reaches the minimum efficient size in the absence of h 



hazard rate h and of the asymptotic endogenous hazard rate ^ (jj — ci 



related to the 



Differentiating the age-dependent hazar d rate given by (|23l) with respec t to 6 and using the 



asymptotic expansion of the error function (lAbramowitz and Stegunl 



19651) . we get 



1 



den{t, I 



2a2 
3a^ 



II- Ci 



9^ 



a 
a' 



■nt, 



1 + 



/i - ci - ^ > 0, 



/i - ci - ^ < 0, 



(26) 



which shows that the total failure rate decreases with age, at least for large enough age, in 
agreement with the literature. 



4.2 Deviations from Zipf 's law due to the finite age of the economy 

Considering, for simplicity, that sq is a degenerate random variable such that Pr[so = sq] = 1^ 
we can determine the deviations from the asymptotic power law tail of the mean density of firm 
sizes (given explicitly by (l66l) in appendix I A. II) due to the finite age of the economy. For this, 
it is convenient to study the s-dependence of the mean number of firms whose sizes exceeds a 
given level s: 

roc 

N{s,t) = / g{s',t)ds' . (27) 

J s 

Zipf 's law corresponds to N{s, t) ~ for large s. 

All calculations done, defining so{t) := sqc^"'^ as being the initial size of an entrant firm at 

time t when Pr[5o = sq] = 1, we obtain the number N(k, t) of firms whose normalized size 

^Mathematically speaking, this hazard rate can be derived form the generic formula that gives the probability 
that a Brownian motion {XtJ^^Q with negative drift, started from Xq > 0, crosses for the first time the lower 
barrier X — 0. 
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Figure 2: The figure quantifies tlie deviations from Zipf's law resulting from the finite age of 
the economy, by showing the mean number N{k, t) of firms of normalized size s/ so{t) larger 
than K as a function of k, for parameters /i = cq, + /i = (satisfying the balance condition), 
and for sq = 100 ■ Smin, Cq = Ci = and reduced times r := = 5; 10; 50. The exact 

asymptotic Zipf's law ~ is also shown for comparison. 



is larger than k at the standardized age r := ^9, 



N{k, t) = B_ K-'- + 5+ - C , 



(28) 



where 



1 



C :-- 



2r] 



erfc 
erfc 
erfc 



-r)T 



In K — ra(?7) 
In /t + ra^f]) 

In /t — ra 
2v^ 



erfc 



sojt) 

'min(^) 



■^min (t) 
min (^) 



erfc 



/ ln(/tp 



2v^ 



V 2v/^ 



(29) 



and g±:= ^[a±a (r/)], a := 2 ■ ^ - 1, a{r]) := ^ + 4//, r] := ^ {d + h). 

Figure [2] shows the mean cumulative number A^(k, r) of firms as a function of the normal- 
ized firm size k, for /i = cq and /i = —d > satisfying to the balance condition of corollary [U 
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for So = 100 ■ Smin, cq = Ci = and reduced times r = 5, 10, 50. As expected, the older 
the economy, the closer is the mean cumulative number N{k, t) to Zipf's law N{k, oo) ~ k^^. 
Beyond r = 50, there are no noticeable difference between the actual distribution of firm sizes 
and its asymptotic power law counterpart. This illustrates graphically the last point discussed 
in remark |5] that, the larger the volatility (beyond some threshold), the faster the convergence 
of the size distribution toward the asymptotic power law. Indeed, the larger the volatility, the 
smaller the age 9 necessary to reach a value of r close to 50. 

The downward curvatures of the graphs for all finite r's show that the apparent tail index 
can be empirically found larger than 1 even if all conditions for the asymptotic validity of Zipf's 
law hold. This effect could provide an explanation for some dissenting views i n the literature 



Cabral and Matal ( 2003 ) and 



Eeckhout 



about Zipf's law. The two recent influential studies by 

( 20oJ ^liave suggested that the distribution of firm and of city sizes could be well-approached 
by the log-normal distribution, which exhibits a downward curvature in a double-logarithmic 
scale often used to qualify a power law. Our model shows that a slight downward curvature 
can easily be explained by the partial convergence of the distribution of firm sizes toward the 
asymptotic Zipf's law due to the finite age of the economy. 

It is interesting to note that two opposing effects can combine to make the apparent exponent 
m close to 1 even when the balance condition does not hold exactly. Consider the situation 
where e := < 1. For e < 1, figure [T] shows that m is always less than one. But, figure [2] 
shows that the distribution of firm sizes for a finite economy is approximately a power law but 
with an exponent larger than one for the asymptotic regime of an infinitely old economy. It is 
possible that these two deviations may cancel out to a large degree, providing a nice apparent 
empirical Zipf's law. 



4.3 Representativeness of the mean-distribution of firm size 

All our results have been established for the average number N(s,t) of firms whose size is 
larger than s, where the average is performed over an ensemble of equivalent statistical real- 
See the comment by Lew '('2009') which suggests that the extreme tail of the size distribution is indeed a power 
law and the reply by Eeckhout (,2009,) . 
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izations of the economy. Since empirical data are usually sampled from a single economy, it is 
important to ascertain if the average Zipf's law accurately describes the distribution of single 
typical economies. The answer to this question is provided by the following proposition whose 
proof is given in appendix I A. 3 [ 

Proposition 4. Under assumptionsUlcmd^ the random number N{s, t) affirms whose size is 
larger than s in a given economy follows a Poisson law with parameter N{s, t) (defined in 
with (|?5l) and ^^): 



Pr 



N{s,t) = n 



N{s,ty 



-N{s,t) 



(30) 



As a consequence of proposition HI we state 
Corollary 2. Under the assumptions of proposition H] the variance of the average relative 

N{s,t) 
7V(s,t) 



distance t77t4t — 1 between the number of firms in one realization and its statistical average is 



given by 



E 



N{s,t) 



- 1 



1 



N{s,ty 



(31) 



Proof The left hand side of the equation above is nothing but the variance of N{s, t) divided 
by N{s, ty. Since, N{s, t) follows a Poisson law, Var iV(s, t) = N{s, t), hence the result. □ 

To give a quantitative illustration, let us consider firms whose sizes evolve according to the 
pure Geometric Brownian Motion, i.e., Pr[so = sq] = 1, cq = ci = h = d = and no minimum 
exit size. Then, N{s, t) = N(s) = g{s')ds', where 



1^0 



■2 I ■'0 



S > So 



a 



(32) 



This expression derives from the general expression (l66l) for Zipf's law given in appendix lA.il 
in the limit Smin — ^ 0. This leads to 



N{s) = No (^) 



. 1 2m 



(33) 



where 



Nn 



/ 9{s)ds = — 

Jso 



(34) 
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10° 10' 10^ 10^ 10" 10= 10° 10 

s/s„ 

Figure 3: Number of firms whose size is larger than s when a = 0.01, z/q = 50 and /i = for 
ten realizations of the economy. The straight red line depicts Zipf's law for the mean number 
of firms. 

is the mean number of firms, whose sizes, at a given time t, are larger than the initial size 
sq. Using (I3TI) for the variance of the relative distance between the number of firms in one 
realization and its statistical average in Corollary 2, and with (|33l) . we obtain that the variance 
of the relative distance is given by ^, where we have assumed that /i = 0, so that Zipf's law 
N(s) ~ s^^ holds for the mean distribution of firm sizes. 

In this illustrative example, the total number of firms is infinite while Nq remains finite. Let 
us consider a data set spanning the range s G (sq, s^,) where s* = 0.01 Nq sq is such that the 
variance of the relative distance between the number of firms in one realization and its statistical 
average remains smaller than 10^^ over the range s E (sq, s*). Suppose that the mean number 
of firms in the economy, whose sizes are larger than sq, is equal to Nq = 10®. Then = 10"^ sq, 
showing that Zipf's law should be observed, in a single realization of an economy, with good 
accuracy over four orders of magnitudes in this example. Figure|3]depicts ten simulation results 
obtained for such an economy. 
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5 Conclusion 



We have presented a general theoretical derivation of Zipf's law, which states that, for most 
countries, the size distribution of firms is a power law with a specific exponent equal to 1: 
the number of firms with size greater than S is inversely proportional to S. Our framework 
has taken into account time- varying firm creation, firms' exit resulting from both a lack of 
sufficient size and sudden external shocks, and Gibrat's law of proportional growth. We have 
identified that four key parameters control the tail index m of the power law distribution of 
firms sizes: the expected growth rate /j, of incumbent firms, the hazard rate h of random exits 
of firms of any size, the growth rate Cq of the size of entrant firms, and the growth rate d 
of the number of new firms. We have identified that Zipf's law holds exactly when a balance 
condition holds, namely when the growth rate rf+co of investments in new entrant firms is equal 
to the average growth rate fi — h of incumbent firms. Thus, Zipf's law can be interpreted as 
a remarkable statistical signature of the long-term optimal allocation of resources that ensures 
the maximum sustainable growth rate of an economy. We have also found that Zipf's law is 
recovered approximately when the volatility of the growth rate of individual firms becomes very 
large, even when the balance condition does not hold exactly. We have studied the deviations 
from Zipf's law due to the finite age of the economy and shown that a deviation of the balance 
condition d + cq = jj, — h can be compensated approximately by the effect of the finite age of 
the economy to give again an approximate Zipf's law. We have also shown that the presence 
of a minimum size below which firms exit allows us to account for the age-dependent hazard 
rate documented in the empirical literature. Our results hold not only for statistical averages 
over ensemble of economies (i.e., in expectations) but also apply to a single typical economy, 
as the variance of the relative difference between the number of firms in one realization and its 
statistical average decays as the inverse of the number of firms and thus goes to zero very fast 
for sufficiently large economies. Therefore, our results can be compared with empirical data 
which are usually sampled for a single economy. Our theory improves significantly on previous 
works by getting rid of many constraints and conditions that are found unnecessary or artificial, 
when taking into account the proper interplay between birth, death and growth. 
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A Appendix 



A.l Derivation of the distribution of firms' sizes: proof of proposition [H 

Consider an economy with many firms born at random times U > to, i E N, where to is the 
starting time of the economy. We assume that no two firms are born at the same t i me so that 



2007 



, def. 



the random sequence {tjjieN defines a simple point process (|Daley and Vere- Jones . 
3.3.II). 

Let Si{t), i E N, t > to be a positive real- valued stochastic process representing the size, 
at time t, of the firm bom at tj. Qbvioi isly, Si(t) = 0, Vt < U. Th e sequence {tj, ^^(t)}^^^ 



defines a simple marked point process (iDaley and Vere- Jones 



2007L def. 6.4.1 - 6.4.II) with 



ground process {tjjieN and marks {^^(t)}^^^- We assume that {U} and {Si{t)} are mutually 
independent and such that the distribution of Si (t) depends only on the corresponding location 
in time U. Consequently, the mark kernel Fm,i{s, t) := Pr [Si{t) < s] simplifies to Fm (s, t\ti). 

For any subset T x S of [to, oo) x ]R_|_, we introduce the counting measure 



The total number of firms whose sizes are larger than s at time t then reads 



(35) 
(36) 



N{s,t) := Nt{[to,t) X [s,oo)), 

Nt{du X ds), 

[to,t)x[s,oo) 
ieN: ti<t 



As a consequence of theorem 6.4.IV.C in Daley and Vere-JonesI (|2007l) we can state that 



(37) 
(38) 

(39) 



Lemma 1. Provided that the ground process {ti}i^m admits a first order moment measure with 
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density vit) w.r.t Lebesgue measure, the counting process N{s, t) admits a first moment 



N{s,t) 



E 



iV(s,t)J , 
/ [1 — Fm {s,t\u)] ■ iy{u) du. 

Jtn 



(40) 
(41) 



Remark 7. When the ground process is an (inhomogeneous) Poisson process, z/(t) is nothing 
but the intensity of the process. 



Proof. By theorem6.4.IV.cin 



Daley and Vere- Jones 



(|2007|) . the first-moment measure Mi ( 



E [Ai'((-)] of the marked point process {tj, S'j(t)}jgN exists since the corresponding moment mea- 
sure exists for the ground process {tjjieN- It reads 



Ml {du X ds) = h'{u)du ■ Fm{ds,t\u). 



(42) 



As a consequence 



N{s,t) = E 



Nt{du X ds) 



_J [to,t)x [s,oo) 



Ml {du X ds) 



[to,i)x [s,oo) 
t 



(43) 



/ v{u)du ■ Frauds, t\u) = I [1 — Fm {s , t\u)] ■ v {u) du . (44) 

J [tQ,t)x[s,<X)) J to 



□ 



As an immediate consequence, provided that S{t) admits a density /m(s, t\u) with respect 
to Lebesgue measure, the counting process iV(s, t) admits a first-moment density 



g{s,t) := fm{s,t\u) ■ iy{u)du. 

J to 



(45) 



This first-moment density does not sum up to one but to a value Noo{t) = lim^^^o N(s, t), which 
remains finite for all finite t. A sufficient condition is that the growths of the number of firms 
and of their sizes are not faster than exponential in time, in agreement with condition (ii) in 
proposition [H Many faster-than-exponential growth processes of the number of firms and of 
their sizes are also permitted, as long as they do not lead to finite-time singularities. 
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Lemma 2. Under the assumptions\T\ |2]anJ|5l the first-moment density of sizes of all the firms 
existing at the current time t reads 

g{s,t)= [ u{u)e-''-^^-''^f{s,t\u)du , t > to , (46) 



J to 

where to(> i^*) is the starting time of the economy (with given by ^) and f{s,t\u) is the 
probability density function of a firm's size at time t and born at time u. 

Proof. Assumptions [U and [2] are enough for lemma [U to hold. Besides, by assumption [51 the 
exit rate of a firm is independent from its size so that /m(s, t\u) = e"'^'^*""^ ■ /(s, t\u), where 
f{s,t\u) denotes the probability density function of a firm's size at time t and born at time 
u. □ 

Lemmas [H and |2l show that, in order to derive proposition [H we just need to consider the 
law of a single firm's size, given that it has not yet crossed the level Smin(^)- The density of a 
single firm's size, that is solution to equation ([3]) embodying Gibrat's law, for a firm born at time 
ti = t — 9i and given the condition that the firm's size Si{t, 6i) is larger than Smin(i), V6'j > 0, 
is given by the following result. 

Lemma 3. Under the as sumptions^\^and^ the probability density function f {s,t\t — 6,sq = sq) 
of a firm's size at time t and aged 9 conditional on sq = sq, taking into account the condition 
that the firm would die if its size would reach the exit level Smin(t), is 



f {s,t\t - 9,so = So) 



^ exp|--fln('^^Vlnf^#TV(^-l-^o)r'^ 



Sminif) J ySminif ) J \ 4t y \'5min('^)/ \'5min('^) 

(47) 
where 

soit) := Soe^-\ r := ^ 9 , 5:=^, <5o := ^ , := ^ • (48) 

2 

Proof. Let us consider a firm born at time u = t — 9, where t denotes the current time and > 
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is the age of the firm. The firm's size S{9, u) is given by the following stochastic process 



(49) 



where 9 = t ~ u, W{9) is a standard Wiener process, while So{u) is the initial size of the firm, 
given So = Sq, and c := n — The process (|49l) with the initial and boundary conditions in 
assumptions [2l and |4] can be reformulated as 



S{9,u) = Sr^U^ + 9)e^^''^'^ 



(50) 



where 



Z{9, u) = \np{u + 9) + {c- ci)9 + aW{9), p{t) :-- 



■^min (^) 



As a consequence. 



(51) 



f{s,t\u,so = So) = -ip 



In 



At) 



,9;u 



where ip{z; 9, u) denotes the density of Z{9, u) which is solution to 

dip{z;9,u) d(p{z;9,u) a"^ d'^(p{z;9,u) 

89 + ~ ""'^ Fz " y d? 

ip{z; 9 = 0,u) = 6{z — In p{u)) , 
f{z = 0;9,u) = , 9>0. 



(52) 



(53) 



These initial and boundary conditions are equivalent to the initial and boundary conditions in 



assumptions [2] and m Using any textbook on stochastic processes (Rednen . 
we get 



2001 



, for instance), 



f{z;9,u) 
[p{u)r 



5i-<5+l 

exp 



exp 



{z-\np{u) - (5 - l-(5i)r)2 

{z + \np{u) -{6-1- (5i)r)^ 
2y/7iT \ 4r 

where 6 and r are defined in (l48l) . Taking into account the relation 



(54) 



p{u) = p(t)e(^i-^«)" 



(55) 
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we rewrite expression (I54|) as 



1 f {z-\np{t)-{5-l-5,)Tf 

Lp{z; 9, u) = exp ' 



By substitution in (l52l) . this concludes the proof of Lemma 3. □ 

Performing the change of variable from birthdate u to age 9 = t — urn (l46l) . and accounting 
for assumptionfH i.e. the fact that z/(t) = uq e"^ * , leads to 

g{s, t) = uit) r e-^'^'^^'E [f {s, t\t - 9, 5o)] d9 , (57) 
Jo 

where = ^ — is the age of the given economy. E [/ (s; t, \t — 9, sq)] denotes the statistical 
average of f (s;t,9\sQ) over the random variable sq. Inasmuch as to should not be smaller than 
U given by dH), we should thus have 9q < 9^ ■- 



As a byproduct, the mean density of firm sizes, conditional on sq = sq is 

g (s, t\So = So) = u{t) / e-('^+'^)V (s, t\t - 9, So = so) d9 . (58) 

Jo 



Thus, substituting (gTj) into (158]) yields 

g{s, t\~so = so) = ^G fin (^j^) ; t, To) , z>(t) = ^ , (59) 

with 

G(z; t, To) := [ ° e-^X^; t, r)dT , (60) 
where ip{z;t,9) is given by (l54l) while 



2 

ro:=— ^0 (ro<T-,), r] := — [d + h) . (61) 



The substitution of (p{z; t, 9) from (|54l) into the integral (|60]) leads to two integrals, which can 
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be reduced to 



X(2, 6', a, /3) := J exp ^- 



[z — ar] 



/3r 



dr 



At ' J 20rr 

whose expression can be obtained by the tabulated integral (7.4.33) in 
(|l965|) by the change of variable u = i/r. This leads to 



(62) 



Abramowitz and Stegun 



G{z; t, To) 



Toa{r]) 



1 



Pity 



with 



,^{az+-a{r))\z+\) 



2^^ 

\z+\ - roa{r]) 
2v^ 



Mv) 

gi(«._+a(,)|._|)gj.f^ / \z-\+Toa{v) 



— g5("2++"('?)|2+l)ei-fc 



\z+\ + Toa{r]) 
2v^ 



a := 5 — 1 — So , a{r]) := \/ ^ Ari , Z- := In 



So(t) 



(63) 



..:=ln^.(64) 



"Smin(^)^ 



For an old enough economy, i.e., when ^ !/«(?]), we can expand expression (l63l) to 
obtain 



1 
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p(t)-"e 



a i(a2+-Q!(»y)|2+|) 



(65) 



Substituting this last expression into equation (1591) for the mean density of firms sizes, and after 
making explicit the s-dependence of the variable z, we finally get 



g{s,t\so = So) 



_so(i) 



1 - 



5min(t) 



-"(»?) 



S > So{t) 



so{t) 



-a{ri) 



5 



So(t) > S > Si„in(t) 

(66) 



(t) 

for large tq 3> ^(r])"^, with so(t) = soe*^" *, as defined by (|48|) . 

According to assumption^ the expectation of g{s,t\so) with respect to sq provides us with 
the unconditional mean density of firm sizes 



1/™ oCo-i 



sairf) 



as s — oo and t — )■ oo 



(67) 
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where m is given by ([7]). This expression (l67l) justifies the statement of proposition [T] and 
concludes the proof. 



A.2 Growth rate of the overall economy: Proof of proposition |2] 



Using the same machinery as in appendix IA.1[ we define the total size of the economy at time 
t as 



n{t) := Yl 

ieN: ti<t 

= s ■ Nt{du X ds) . 
Jto 

Under the assumptions of proposition [H by theorem 6.4.V.iii in 
we get 



(68) 



Daley and Vere-JonesI (l2007h . 



Vt{t) := E Vt{t) 



/•TO 

u{t) / e-^-'E r)] rfr . 
Jo 



(69) 
(70) 



For simplicity, let us consider the case where Smin = 0. This assumption is not necessary, but 
greatly simplifies the calculation. Under this assumption, the size of an incumbent firm follows 
a geometric Brownian motion so that 



E[^(t,r)] = so(t)e 



(5-5o)r 



(71) 



where 5, 5q and So(t) are defined in (1481) . Substituting (TtTI) into (fTOl) gives 



dl{u) . 



(72) 
(73) 
(74) 
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This last equation shows that jj, — his the return on investment of the economy. By integration, 
we get the limit growth rate of the economy 



, d\nn{t) J^-^' f'-h>d + Co 

lim = <^ . (75) 



t^co dt 



d + co, n — h < d + cq 



This concludes the proof of proposition [21 when Smin = 0. When Smin 7^ and grows at the 
rate ci > 0, the result can still be proved along the same lines but at the price of more tedious 
calculations since the expectation in (TtTI) involves eight error functions. 



A.3 Representativeness of the mean-distribution of firm size: Proof of 
proposition |4] 



Dalev and Vere- Jones 



( 2007 ) which 



The proof of proposition |4] follows from lemma 6.4. VI in 
states that a marked point process that has mark kernel Fm{s, t\u), and for which the Poisson 
ground process has intensity measure u(u)du, is equivalent to a Poisson process on the product 
space with intensity measure A(du x ds) = F^ids, t\u) ■ v{u)du. 

Thus, under the assumptions [T] and [2l using the notations of appendix IA.1[ the marked 
point process {tj, 5'i(t)}jgN is a compound Poisson process with intensity measure A{du x ds). 
Consequently 



Pr 



N{s,t) = n 



PT[Nt{[to,t) X [s,oo)) = n] , 

I[to,t)x[s,oo)Prnids,t\u) ■ iy{u)duj 



exp 



Fm{ds, t\u) ■ iy{u)du 



[to,t)x[s,oo) 



nl 



(76) 



by lemma [B 
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